DEC-22-S006 17:06 FROM : BSTZ 



714557334? 



TO:USPTO 



P.6'19 



Appl. No. 10/019,882 

Amdt. Dated December 22, 2006 

Reply to Final Office Action of October 26, 2006 

Amendments tojhej:iaims: 

This listing of claims will replace all prior versions, and listings, of claims in the application: 

Listing of Claims: 

1 . (currently amended) A method comprising: 

(a) calculating estimated weights for identified errors in recognition of utterances based 
on a reference string: 

(b) marking sections of the utterances as being misrecognized and associating the 
estimated weights with the sections of the utterances; and 

(c) using the weighted sections of the utterances to convert a speaker independent model 
to u speaker dependent model. 

2. (original) The method of claim 1, wherein parts (a) - (c) are repeated at least 

once. 

3. (original) The method of claim 1, wherein the utterances are converted into a 
recognized phone string a finn time through applying the speaker independent model and 
thereafter through applying the most recently obtained speaker dependent model. 

4. (previously presented) The method of claim 1 , wherein calculating the estimated 
weights comprises computing an average likelihood difference per frame and then computing a 
weight value by averaging the average likelihood difference over error words. 

5. (currently amended) The method of claim 1, wherein calculating the estimated 
weights comprises computing an average likelihood difference per frame according to equation 
(1) as follows: 



(1), 



"e ~ H b K * ~ K b 
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where Ml is a log likelihood of hypothesis word n, tf£ is a beginning frame index (in time), 
and ti " is an end frame index, and , and are counter parts for [[a]] the reference string. 



6. (previously presented) The method of claim 5, wherein calculating the estimated 
weights further comprising a weight for rnisrecugnized words of a particular speaker "i" 
according to equation (2) as follows: 



is a number of misrecognized words. 

7. (previously presented) The method of claim 1 , wherein for a particular speaker, 
different misrecognized words have different weights. 

8. (previously presented) A method comprising: 

(a) recognizing utterances through converting the utterances into a recognized siring; 

(b) comparing the recognized string with a reference string to determine errors; 

(c) calculating estimated weights for sections of the utterances; 

(d) marking the errors in the utterances and providing corresponding estimated weights to 
form adaptation enrollment data; and 

(e) using the adaptation enrollment daia to convert a speaker independent model to a 
speaker dependent model. 

9. (previously presented) The method of claim 8, wherein the utterances are 
converted into the recognized suing through applying the speaker independent model. 

10. (original) The method of claim 8, wherein parts (b) - (e) are repeated until 
differences between the reference and recognized strings are less than a threshold. 
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1 1 . (previously presented) The method of claim 8, wherein the utterances are 
converted into a recognized string a first time through applying the speaker independent model 
and thereafter through applying the most recently obtained speaker dependent model. 

1 2» (previously presented) The method of claim 8, wherein calculating the estimated 
weights comprises computing an average likelihood difference per frame and then computing a 
weight value by averaging the average likelihood difference over all error words. 

13. (currently amended) The method of claim 8, wherein calculating the estimated 
wioghta weights comprises calculating an average likelihood difference per frame according to 
equation (1) as follows: 

where is a log likelihood of hypothesis word n, u% is a beginning frame index (in rime), 
and#" is an end frame index, and and are counter parts for the reference string. 

14. (previously presented) The method of claim 13, wherein calculating the estimated 
weights comprises calculating a weight for misrecognized words of a particular speaker "i" is 
calculated according to equation (2) as follows; 

1 m 

W i = — * z\l"\ (2), wherein m 

n = 1 

is a number of misrecognized words. 

15. (previously presented) The method of claim 8, wherein foT a particular speaker, 
different misrecognized words have different weights. 

16. (currently amended) An article of manufacture comprising: 

a storage medium having instructions thereon which when executed cause a processor to 
perform operations comprising: 
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(a) calculating estimated weights for identified errors in recognition of utterances based 
on a reference string : 

(b) marking sections of the utterances as being misrecognized and associating the 
estimated weights with the sections of the utterances; and 

(c) using the weighted sections of the utterances to convert a speaker independent model 
to a speaker dependent model. 

17. (previously presented) The article of manufacture of claim 16 7 wherein parts (a) - 
(c) arc repeated at least once. 

18. (previously presented) The article of manufacture of claim 16, wherein the 
utterances are converted into a recognized phone string a first time through applying the speaker 
independent model and thereafter through applying the most recently obtained speaker 
dependent model, 

19. (previously presented) The article of manufacture of claim 16, wherein the 
estimated weights are computed through computing an average likelihood difference per frame 
and then computing a weight value by averaging the average likelihood difference over error 
words. 

20. (currently amended) The article of manufacture of claim 16, wherein an average 
likelihood difference per frame is used to calculate the estimated weights and is computed 
according to equation (1) as follows: 

Ln= n x 

//*-//£ K h 

where is a log likelihood or hypothesis word n, /f£ is a beginning frame index (in time), 
and He is an end frame index, and , R h and K are counter parts for [[a]] the reference string. 
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21 . (previously presented) The article of manufacture of claim 20, wherein a wcighi 
for misrecogniaed words of a particular speaker "i" is calculated according to equation (2) as 
follows: 

1 m 

w i : = — * (2), wherein m 

a number of misrecognized words. 

22. (previously presented) The article of manufacture of claim 16, wherein for a 
particular speaker, different inisrecognized words have different weights. 

23. (previously presented) An article of manufacture comprising: 

a storage medium having instructions thereon which when executed cause a processor to 
perform operations comprising: 

(a) recognizing utterances through converting the utterances into a recognized phone 

string; 

(b) comparing the recognized string with a reference string to determine errors; 

(c) calculating estimated weights for sections of the utterances; 

(d) marking the errors in the utterances and providing corresponding estimated weights to 
form adaptation enrollment data; and 

(e) using the adaptation enrollment data to convert a speaker independent model to a 
speaker dependent model. 

24. (previously presented) The article of manufacture of claim 23, wherein the 
utterances are converted into the recognized string through applying the speaker independent 
model. 

25. (original) The article of manufacture of claim 23, wherein parts (b) - (c) are 
repeated until differences between the reference and recognized strings are less than a threshold. 

26. (previously presented) The article of manufacture of claim 23, wherein the 
utterances are converted into a recognized string a first time through applying the speaker 
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independent model and thereafter through applying the most recently obtained speaker 
dependent model. 

27. (previously presented) The article of manufacture of claim 23, wherein the 
estimated weights are computed through computing an average likelihood difference per frame 
and then computing a weight value by averaging the average likelihood difference over error 
words. 

28. (currently amended) The article of manufacture of claim 23, wherein an average 
likelihood difference per frame is used to calculate the estimated weights and is calculated 
according to the equation (1) as follows: 

where 11 l is a log likelihood of hypothesis word n, is a beginning frame index (in time), 
and H" is [[a]] an end frame index, and tfL , rfj *!? *re counter parts for the reference string. 

29. (previously presented) The article of manufacture of claim 28, wherein a weight 
for misrecognized words of a particular speaker "i" is calculated according to equation (2) as 
follows: 

t 771 

W- t = — * II H (2), wherein m 

m n - 1 

is a number of misrecognized words. 

30. (previously presented) The article of manufacture of claim 23, wherein for a 
particular speaker, different misrecognized words have different weights. 
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